Dependency-Based Phrase Alignment
نویسندگان
چکیده
Phrase alignment is the task that requires the constituent phrases of two halves of a bitext to be aligned. In order to align phrases, one must discover them first and this article presents a method of aligning phrases that are discovered automatically. Here, the notion of a ‘phrase’ will be understood as being given by a subtree of a dependency-like structure of a sentence called linkage. To discover phrases, we will make use of two distinct, language independent methods: the IBM-1 model (Brown et al., 1993) adapted to detect linkages and Constrained Lexical Attraction Models (Ion & Barbu Mititelu, 2006). The methods will be combined and the resulted model will be used to annotate the bitext. The accuracy of phrase alignment will be evaluated by obtaining word alignments from link alignments and then by checking the F-measure of the latter word aligner.
منابع مشابه
Statistical Phrase Alignment Model Using Dependency Relation Probability
When aligning very different language pairs, the most important needs are the use of structural information and the capability of generating one-to-many or many-to-many correspondences. In this paper, we propose a novel phrase alignment method which models word or phrase dependency relations in dependency tree structures of source and target languages. The dependency relation model is a kind of...
متن کاملLinguistically-motivated Tree-based Probabilistic Phrase Alignment
In this paper, we propose a probabilistic phrase alignment model based on dependency trees. This model is linguistically-motivated, using syntactic information during alignment process. The main advantage of this model is that the linguistic difference between source and target languages is successfully absorbed. It is composed of twomodels: Model1 is using content word translation probability ...
متن کاملTranslation reranking using source phrase dependency features
We describe a N-best reranking model based on features that combine sourceside dependency syntactical information and segmentation and alignment information. Specifically, we consider segmentation-aware ”phrase dependency” features.
متن کاملHMM Word-to-Phrase Alignment with Dependency Constraints
In this paper, we extend the HMM wordto-phrase alignment model with syntactic dependency constraints. The syntactic dependencies between multiple words in one language are introduced into the model in a bid to produce coherent alignments. Our experimental results on a variety of Chinese–English data show that our syntactically constrained model can lead to as much as a 3.24% relative improvemen...
متن کاملFeature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کاملStructural Phrase Alignment Based on Consistency Criteria
In this paper, we propose a new method for phrase alignment using a dependency type distance and a distance-score function. With this method, appropriate correspondences can be selected among correspondence candidates that often include ambiguous or incorrect ones. Furthermore, this method makes it possible to measure the overall alignment consistency. We conduct an alignment experiment using 5...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006